RESEARCH ARTICLE 


7 


a ~ 
YW 


SPECIAL ISSUE 


International Research Journal on Advanced Science Hub 
2582-4376 

Vol. 05, Issue 05S May 

www.rspsciencehub.com 


® Check for updates 


http://dx.doi.org/10.47392/irjash.2023.S045 


RSP Science Hub 
International Conference on intelligent COMPUting TEchnologies and Research (i-COMPUTER) 2023 


Predict Customer Churn through Customer Behaviour using Machine 
Learning Algorithms 


Harini T !, Hari Krishna T 7, Sushma G ', Charankumar Reddy P 1’ Mohammad Thahir S ! 


"Department of Computer Science and Engineering, Annamacharya Institute of Technology and Sciences, Rajampet, Andhra 
Pradesh, India. 

? Associate Professor & Head, Department of Artificial Intelligence and Machine Learning, Annamacharya Institute of Technology 
and Sciences, Rajampet, Andhra Pradesh, India. 


Email: thimmareddyharini2006 @ gmail.com 


Abstract 

Customers are becoming more concerned to the quality of service (QoS) offered 
by organizations in the present. However, the present day shows greater rivalry 
in offering the clients with technologically innovative QoS. However, an orga- 
nization may benefit from effective customer relationship management systems 
in order to increase sales, maintain relationships with existing customers and 
improve customer retention. The customer retention strategies can benefit 
greatly by the use of machine learning models like Decision Tree, Naive-Bayes 
Classification, Logistic Regression algorithms. 
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1. Introduction 


Every organization’s profit and revenue are always 
increased by its customers, so it is crucial for organ- 
isational managers to maintain an effective cus- 
tomer relationship management system by identi- 
fying their target clients and building strong rela- 
tionships with them. (Brandusoiu, Toderean, et 
al.) Additionally, the company will benefit from 
the CRM system by being able to recognise the 
most significant client group and their behaviour 
which helps to better understand its retention strat- 
egy. Lower customer churn rates are associated 
with stronger customer loyalty, therefore apply- 
ing machine learning algorithms like the Decision 
Tree, Naive-Bayes Classification, Logistic Regres- 
sion algorithms can help to prevent the customer 
churn. (Maldonado et al.) 
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2. Background 


Churning can occur voluntarily where a customer 
chooses to end their relationship with a specific 
seller or involuntarily where the customer’s act 
forces them to avoid engaging in any kind of busi- 
ness relationship with a specific entity. (Tsai, Lu, 
et al.) Forceful churning occurs when a customer 
violates the traditional rules for conducting business 
within a specific location. In this situation, it may 
be the result of illegal activities like theft or a fail- 
ure to make a timely payment. A voluntary kind of 
churning is sparked by issues related to consumer 
satisfaction or the loss of an organizations competi- 
tive edge. (Burez and Van Den Poel) Any company 
that deals with a variety of consumers should regu- 
larly rotate its evaluation practices. 
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3. Causes of Churning 


The influence by former customers on current con- 
sumers who are willing to continue doing business 
with a company has a negative effect on the amount 
of money a company receives for a certain service or 
good. Customers may have unrealistic expectations 
that are not met when they buy products from a cer- 
tain company, as evidenced by their preference for 
newer products. For the corporations that manufac- 
tures machinery is difficult to persuade customers 
about a certain machine when they test it and dis- 
cover it underperformed. (Hossain, Suchy, et al.) If 
the device meets additional requirements, the con- 
sumer will choose a different seller. Particularly 
in these situations, consumer management strate- 
gies pay off, prompting the organisation to increase 
the quality of it production and expand their brand 
portfolio. (Vafeiadis et al.) Price is one of the pri- 
mary factors influencing consumer decisions to buy 
a particular good. More than any other factor, cus- 
tomers feel the pressure on their sale of premium 
services and goods when they have choices to less 
costly products that can perform the intended duty. 
Price discrepancies in the product market destroy a 
company’s ability to retain customers in compari- 
son to that of their rivals. According to the hedo- 
nic pricing theory, customers assess the value of a 
product by comparing its price to that of competing 
businesses. (Kaya et al.) A corporation’s goods are 
at danger of being rejected by potential customers 
if the pricing plan is thought to be flawed by the 
general public. Customer happiness is the primary 
determinant of whether a certain consumer would 
adhere to the terms of a particular business or reject 
the offer. Idris, Rizwan, Khan, et al.) Customers are 
satisfied if their expectations are met or exceeded by 
the performance of the specific good or service they 
are about to purchase. In most circumstances cus- 
tomers prefer to stick with the company that offer 
top-notch customer care services. Once a consumer 
develops the habit of purchasing goods from a cer- 
tain seller, the company has won the customer’s loy- 
alty, and it is up to the company to do all reasonable 
efforts to keep the customer. (Baghla, Gupta, et al.) 


4. Conceptual Framework 


Aspects of the marketing mix theory provide 
stronger justifications for why clients decide to leave 
a certain company hub. Customers’ impressions 
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of a certain corporation are overturned by product 
attributes, particularly the worth and pricing. For 
instance, a research in the Australian telecommuni- 
cations sector established that the consumers’ choice 
to decline utilising local networks was supported 
by the quality of the network they utilised. Due 
to the vast number of infrequent small scale cus- 
tomers who just phone and use the internet, poor 
network quality almost always affects the majority 
of patrons. Selling prices and advertising have a 
tremendous influence on consumer behaviour in any 
industry. When a business offers its clients advan- 
tageous rates, their loyalty will grow and stay con- 
stant. (Farquad, Ravi, Raju, et al.) 


The anticipation and confirmation hypothesis fre- 
quently offers a crystal-clear explanation for why 
consumers act in particular ways. The broad strokes 
of typical customer behaviour are seen in Figure 1. 
AS was previously stated, a customer’s expectations 
greatly affect whether they would purchase a par- 
ticular product, and it is the obligation of a brand ’s 
marketing team to ensure that the pricing are reason- 
able. When customers use a service and determine 
that it complies with standards, their loyalty to a par- 
ticular company always grows. (Siu, Zhang, Yau, et 
al.) 


5. Churn Prediction 


When monitoring a company’s _ sustainability 
turnover rate, churn analysis is implicit. Churn is 
a term used to describe the number of customers 
who cancelled their subscriptions in a given period 
of time in the telecom industry. The amount of 
customers who arrive and exit over a specific time 
period is measured as the churn rate. Additionally, 
the shifting of consumers from one business to 
another is referred to as churn in the telecommu- 
nications sector. As the particular industry works 
hard to keep more profitable clients, the current 
situation is showing a bigger quantity of churning 
customers. Figure 2 depicts the algorithm used 
to train your data set and model. The churn can 
also be divided into two categories. Involuntary 
churn is the term used when an industry decides 
to lose customers due to bill paying issues, fraud, 
or another similar issue. Contrarily, the specific 
behaviour turns into a voluntary model of potential 
customer behaviour. (Ullah et al.) 
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FIGURE 1. The broad strokes of typical cus- 
tomer behaviour 
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FIGURE 2. An algorithmic flow chart 
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6. Prediction of the churning through a machine 
learning algorithm 


The quickest way to spot client turnover in a firm 
is through data analysis. When deciding on the 
appropriate course of action to be followed by a 
corporation to sustain a regular trade environment 
in the product market, for example, using con- 
sumer databases is made easier by employing E- 
commerce. The knowledge of consumer behaviour 
in relation to their data pre-processing facilities 
is highlighted for a producer. For instance, it is 
much preferable to use a machine learning algo- 
rithm when conducting a regression study that will 
show how different businesses and goods affect con- 
sumer behaviour. The managing team should be 
persuaded to take action against customer fraud by 


using graphical presentations of the results, it is 
advised. Tools for analysing analytical data should 
be used, particularly when providing the marketing 
team with an instant snapshot of ongoing usual mar- 
ket behaviour. When analysing a company’s trading 
behaviour, categorization is crucial in determining 
its strong point and weak point. Additionally, the 
variable distribution tables will clearly demonstrate 
the mean and mode features of the purchasing teams 
in a larger context. 


7. Decision Tree 


A decision tree is a tree structure that resembles a 
flowchart, in which each leaf node symbolises the 
result and each interior node represents a feature (or 
attribute). A decision tree’s root node is the first 
node from the top. Based on the attribute value, 
it learns to partition. It is known as recursive par- 
titioning and it divides the tree in this way. Your 
decision-making is aided by the flowchart. It is an 
image that, like a flowchart diagram, faithfully rep- 
resents how people think. It is easy to grasp and 
interpret decision trees. 


8. Logistical Regression 


In the early 20th century, the biological sciences 
employed logistic regression. Later, it was put to 
use in numerous social science contexts. When the 
target variable (dependent variable) is categorical, 
logistic regression is employed. To identify spam in 
an email, either (0). whether the tumour is malig- 
nant(1) or not(O). Think of a situation where we 
must categorise if an email is spam or not. Setting 
up a threshold on which classification can be per- 
formed is necessary if we utilise linear regression 
to solve this problem. If the actual class is mali- 
cious, the estimated constant value is 0.4, and the 
minimum value is 0.5, the data point will be cate- 
gorised as not malicious, which could have signifi- 
cant effects in real time. This example demonstrates 
that categorization issues do not lend itself to lin- 
ear regression. Due of the limitless nature of lin- 
ear regression, logistic regression enters the picture. 
They just range from 0 to 1, and nothing else. 


9. Naive Bayes Classifier 


Naive Bayes classifiers are a subset of Bayes 
Theorem-based classification methods. It is a col- 
lection of algorithms rather than a single method, all 
of which follow the same essential tenet: that each 
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pair of characteristics being categorised is indepen- 
dently of the others. To get things going, let’s con- 
sider a dataset. Consider a fictitious dataset that 
describes the optimum conditions for a round of 
golf. Based on the present conditions, each tuple 
classifies whether the weather is suitable for playing 
golf as ’Yes” or ”No”. 


10. Conclusion 


As a result of the discussion above, it can be said 
that any organisation, whatever of its form, needs 
to be concerned about customer turnover. Cus- 
tomer retention is the process of preserving a cus- 
tomer’s loyalty through comprehending their needs 
and meeting them appropriately. The organisational 
management will be helped by a strong churn pre- 
diction model to predict consumer churn. 

Decision Tree, Naive-Bayes Classification, 
Logistic Regression may be useful for estimating 
the turnover rate depending on the complicated data 
of the telecommunication sector. The concept of 
client retention as well as the churn forecast were 
the main topics of the mentioned paper. Along 
with the technique, the usage of Decision Tree, 
Naive Bayes Classification, Logistic Regression is 
to improve the churn prediction process has also 
been covered here. 
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